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Abstract —We propose pilot reuse (PR) in single cell for massive 
multiuser multiple-input multiple-output (MIMO) transmission 
to reduce the pilot overhead. For spatially correlated Rayleigh 
fading channels, we establish a relationship between channel 
spatial correlations and channel power angle spectrum when 
the base station antenna number tends to infinity. With this 
channel model, we show that sum mean square error (MSE) 
of channel estimation can be minimized provided that channel 
angle of arrival intervals of the user terminals reusing the pilots 
are non-overlapping, which shows feasibility of PR over spatially 
correlated massive MIMO channels with constrained channel 
angular spreads. Regarding that channel estimation performance 
might degrade due to PR, we also develop the closed-form robust 
multiuser uplink receiver and downlink precoder that minimize 
sum MSE of signal detection, and reveal a duality between them. 
Subsequently, we investigate pilot scheduling, which determines 
the PR pattern, under two minimum MSE related criteria, and 
propose a low complexity pilot scheduling algorithm which relies 
on the channel statistics only. Simulation results show that the 
proposed PR scheme provides significant performance gains over 
the conventional orthogonal training scheme in terms of net 
spectral efficiency. 

Index Terms —Pilot reuse, massive MIMO, multiuser MIMO, 
pilot scheduling, robust transmission. 

I. Introduction 

ASSIVE multiple-input multiple-output (MIMO) trans¬ 
mission employs a large number of antennas at the 
base station (BS) to serve a relatively smaller number of 
user terminals (UTs) simultaneously 0. With the potential 
large gains in spectral efficiency and energy efficiency, massive 
MIMO is a promising technology that the next generation of 
wireless systems may incorporate, and has received tremen¬ 
dous research interest recently a, a. 

Channel state information (CSI) at the BS plays an im¬ 
portant role in massive MIMO transmission, and in realistic 
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systems it is typically obtained with assistance of the pe¬ 
riodically inserted pilot signals 0. In time-division duplex 
(TDD) massive MIMO transmission, CSI at the BS can be 
obtained from uplink (UL) training via leveraging the channel 
reciprocity 0, 0. For the conventional orthogonal training 
(OT) scheme 0, the pilot overhead is proportional to the 
number of the UT antennas. As the UT antenna number grows, 
the heavy pilot overhead decreases the system efficiency 
greatly and can become the system bottleneck. 

In order to reduce the pilot overhead, we propose pilot reuse 
(PR) in single cell for massive MIMO transmission in this 
paper. The motivation stems from that, in realistic outdoor 
wireless propagation environments where BS is located at 
an elevated position, the scattering around the BS is usually 
limited, and the MIMO channels are not spatially isotropic 
0, 0, i-e., most of the channel power lies in a finite number 
of spatial directions compared with the whole massive MIMO 
channel dimension. For UTs with channels lying in almost 
orthogonal spatial directions, PR is feasible and beneficial. 

In the proposed PR scheme, massive MIMO transmission 
consists of the following phases: statistical CSI acquisition for 
pilot scheduling, UL training for channel estimation, UL data 
transmission, and downlink (DL) data transmission. The pilot 
scheduler at the BS determines the PR pattern, and allocates 
the available pilot signals to the UTs. Due to the slow-varying 
nature of the long term channel statistics, it is reasonable 
to exploit the statistical CSI at the BS to perform pilot 
scheduling. With the resulting PR pattern, the UTs transmit 
the respective assigned pilot signals periodically to enable the 
BS to obtain the channel estimates. The channel estimation 
performance might degrade due to PR, thus it is natural to 
design the UL and DL data transmissions robust to the channel 
estimation error. 

In this work, we consider the spatially correlated Rayleigh 
fading channels, and show that when the BS antenna number 
tends to infinity, eigenvectors of the channel covariance matrix 
are determined by the BS array response vectors, while eigen¬ 
values depend on the channel power angle spectrum (PAS), 
which reveals a relationship between channel spatial correla¬ 
tions and channel power distribution in the angular domain. 
For this channel model, we show that sum mean square error of 
channel estimation (MSE-CE) can be minimized, provided that 
channel angle of arrival (AoA) intervals of the UTs reusing the 
pilots are non-overlapping, which shows feasibility of PR over 
spatially correlated massive MIMO channels with constrained 
channel angular spreads (ASs). Regarding that channel esti¬ 
mation performance might degrade due to PR, we investigate 
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robust data transmissions for both UL and DL with channel 
estimation error due to PR taken into account. The closed- 
form robust multiuser UL receiver and DL precoder which 
are applicable to arbitrary PR pattern based on the minimum 
MSE of signal detection (MMSE-SD) criterion are developed, 
and an interesting MMSE duality between them is revealed. 
Subsequently, we study pilot scheduling under two MMSE 
related criteria, and in both cases the designs are formulated 
as combinatorial optimization problems. We show that both 
criteria can be optimized provided that channel AoA intervals 
of the UTs reusing the pilots are non-overlapping, and propose 
a low complexity pilot scheduling algorithm (called the sta¬ 
tistical greedy pilot scheduling [SGPS] algorithm) motivated 
by the channel AoA non-overlapping condition. Simulation 
results show that the proposed PR scheme provides significant 
performance gains over the conventional OT scheme in terms 
of net spectral efficiency. 

Related Works and Our Contributions: Most of the previous 
works assumed pilot reuse among cells for massive MIMO 
transmission, where UTs in the same cell use orthogonal pilots, 
while the same set of orthogonal pilots is reused among cells 
0, E), (HD. It has been shown that pilot contamination ITO) 
caused by inter-cell pilot reuse can degrade the performance 
of massive MIMO transmission. In order to mitigate pilot 
contamination, several approaches including, e.g., coordinated 
channel estimation ED, time-shifted pilot allocation CD, 
eigenvalue decomposition based blind channel estimation EH, 
cooperative pilot contamination precoding El, and distributed 
MMSE precoding El were proposed, respectively. In contrast 
to these existing works where pilot overhead was simply set 
to be fixed, our work focuses on reducing the pilot overhead, 
and in the meanwhile, balancing the tradeoff between the pilot 
overhead and the pilot interference^] Specifically, we propose 
PR among UTs, where the required number of orthogonal 
pilots can be much smaller than the number of UTs in a cell, 
by allowing that different UTs in a cell share the same pilot. 
We investigate pilot scheduling, channel estimation, and robust 
UL and DL transmissions under the systematic PR framework, 
which, has not been thoroughly addressed in the literature yet. 
Compared with existing works which dealt with inter-cell pilot 
contamination, our work is more general in the sense that 
we deal with both the pilot interference and pilot overhead 
under the PR framework. In this paper, our analysis focuses 
on the single-cell scenario for the sake of clarity. Regarding 
the multi-cell scenario where pilot contamination persists ED, 
performances of the proposed PR scheme will depend on 
the system configurations. For example, if statistical CSI 
coordination among cells is possible, i.e., channel covariance 
matrices can be exchanged among cells, pilot scheduling and 
pilot interference (as well as pilot contamination) mitigation 
can be performed jointly via exploiting statistical CSI of the 
intra-cell as well as inter-cell UTs. 

Notations: We use j = \J— 1 to denote the imaginary unit. 
Upper (lower) case boldface letters are used to denote matrices 
(column vectors). In denotes the N x N dimensional identity 

1 Pilot interference caused by intra-cell PR is defined in 03), and can be 
seen as intra-cell pilot contamination. 


matrix, and the subscript is omitted for brevity in some cases 
where it is clear. 0 denotes the all-zero vector (matrix). The su¬ 
perscripts (-) H , (-) T , and (•)* denote the conjugated-transpose, 
transpose, and conjugate operations, respectively. The operator 
diag {x} denotes the diagonal matrix with x along its main 
diagonal, and tr {-} denotes the matrix trace operation. We 
employ [a]j and [A], ■ to denote the z lh element of the vector 
a, and the (i, j) th element of the matrix A, respectively, where 
the element indices start from 1. ||a|| 2 = \Ja H a denotes 
the f^-norm of a, and ||X|| F = i/tr {X ff X} denotes the 
Frobenius norm of X. (a, b) = a ;/ b denotes the inner product 
between a and b. A >- 0 (A ^ 0) denotes that A is 
Hermitian positive definite (semi-definite), and A y B means 
that A B y 0. C MxN denotes the M x N dimensional 
complex vector space. E {•} denotes the expectation operation. 
CAf (a., B) denotes the circular symmetric complex Gaussian 
distribution with mean a and covariance B. S(-) denotes the 
Dirac delta function. |_a;J denotes the largest integer that is not 
greater than x. The notation = is used for definitions, and ~ 
means “be distributed as”. The superscripts “p”, “u”, and “d” 
stand for the expressions related to pilot, UL data, and DL 
data, respectively. 

Outline: The rest of the paper is organized as follows. 
In Section El we investigate the massive MIMO channel 
model, and establish the relationship between channel spatial 
correlations and channel PAS. In SectionQIIl we present PR for 
UL channel estimation and show how PR affects the channel 
estimation performance. We also provide a condition under 
which the MSE-CE can be minimized. In Section IIVI we 
develop the robust multiuser UL receiver and DL precoder 
under the MMSE-SD criterion, and reveal a MMSE duality 
between them. In Section [Vj we study pilot scheduling and 
propose a low complexity pilot scheduling algorithm that relies 
on the channel statistics only. Simulation results are provided 
in Section [VI] and the paper is concluded in Section 1 VIII 

II. Massive MIMO Channel Model 

We consider massive MIMO transmission in the TDD mode 
in single-cell scenario, where the BS with M antennas serves 
K (<C M) single-antenna UTs over frequency-flat fading chan¬ 
nels on a narrow-band sub-carrier. We assume that channels 
vary in time according to the block fading model, where 
channel states stay constant over the coherence block with 
a length of T symbols, and evolve from block to block in an 
independent and identically distributed manner according to 
some ergodic process. 

With the ray-tracing based approach (7), lfT6il . fl7l . the UL 
channel between the M antennas at the BS and the antenna 
of the fc th UT can be modeled as 

^max 

g k = J v (0) g k (9) dO = J v(9)g k (9) dd (1) 

Qmin 

where g k (0) and v ( 6 ) £ C Mxl are the complex channel gain 
function and the BS array response vector corresponding to the 
incidence angle 9, respectively. We assume that ||v(0)|| 2 = 
VM for power normalization. We assume that the channel 
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power seen at the BS is constrained to lie in the angle interval 
A = [f? mm , 0 max ], which can be achieved via placing directional 
antennas at the BS, and thus no power is received at the BS 
for incidence angle 9 ^ A. 

We assume that the channel phases are uniformly dis¬ 
tributed, thus E{gfc} = 0. We assume that chan¬ 

nels with different incidence angles are uncorrelated, i.e., 
(0) 9k ( 0, )l = PkSk (9) 5(9 - 9') where /3 fc represents 
the large scale fading, and S k ( 9 ) represents the channel PAS 
which models the channel power distribution in the angular 
domain ED- Then from ©, the channel covariance matrix 
(BS spatial correlation matrix) is given by 

e m “ 

R* = E {skSk } = pk j v (9) v H (9) Sk (9) d 9. (2) 

0min 

We assume that 5k (9) d 9 = 1, and channel power 
normalization should be satisfied as 

gmax 

tr {Rfc} = /3fcM J Sfc (9) d9. (3) 

gmln 

A specific property of the massive antenna array is its high 
resolution to the channels in the angular domain II1 9H . and we 
introduce an assumption about it in the following. 

Assumption 1: 42(71/ Array response vectors corresponding 
to distinct angles are asymptotically orthogonal when the BS 
antenna number tends to infinity, i.e., for V£,i7 G A. 

J im tt(v(0, v W> ( 4 ) 

M->oo M 

Note that Assumption [j] is valid for uniform linear array 
(ULA) as one shall see in Remark[[] Based on this assumption, 
we can obtain the following result on massive MIMO channel 
covariance matrix. 

Lemma 1: Let 

V = VM ^ ^ ^°^ ’ v ^ ’ • • • ’ v (V’M-i))] (5) 

[ffc] m = PkM ■ Sfc (9 (lp m -l)) [$ (i>m) - 9 (V’m-l)] , 

for m=l,2, ...,M (6) 

where ip m ' = m'/M for m' = 0,1,..., M, and 9 = 9 (ip) 
over the support [0,1] is a strictly increasing continuous 
functioijl that satisfies 9 (0) = 9 mm and 9 (1) = (9 max . Then 
under Assumption [Q matrices V ;/ V and Rfc tend to be 
the identity matrix and Vdiag {rfc} Y H , respectively, when 
M —> oo, in the sense that, for fixed positive integers i and j, 

lim [V H V-I M ] i7 . = 0 (7) 

M-koo L J % <3 


lim [Rfc — Vdiag {r fe } V fl ] = 0. 

M—too 

Proof: See Appendix lAl 


( 8 ) 


2 The function (•?/;) can be interpreted as a mapping from the space domain 
to the physical angle domain, and it indeed depends on the BS array structure. 
We assume the function $ (■?/;) to be strictly increasing and continuous over 
the support to guarantee that the function is a one-to-one mapping. 


The result in Lemma Q] indicates that, when the BS antenna 
number M is sufficiently large, the channel covariance matrix 
can be well approximated by 

Rfc « Vdiag {r fe } V H . (9) 


Note that the matrix V tends to be unitary when M is 
sufficiently large. This establishes a relationship between chan¬ 
nel spatial correlations and channel power distribution in the 
angular domain. Specifically, for massive MIMO channels, 
eigenvector matrices of the channel covariance matrices for 
different UTs tend to be the same, and are determined by the 
BS array response vectors, while eigenvalues depend on the 
respective channel PASs. 

Remark 1: When BS is equipped with the ULA, and the 
M antennas are spaced with a half wavelength distance, the 
array response vector can be represented as m 


v(6>) 


l,exp(-j7rsin (9 )),..., 


T 


exp (—jn(M — 1) sin (9)) 


00 ) 


We assume that the AoA interval equals A = [—7t/2, 7t/ 2], 
and it is not hard to show (Q]) in Assumption [Q i.e.. 
Assumption |T| is valid in this case. Let 9 = 9 (ip) = 
arcsin (2ip — 1), then 9(ip m > ) = arcsin ^2™— lj for 

m! = 0,1,..., M, and elements of V reduce to [V]j • = 
exp {-Jilt ^ for i = 1,2,...,M and 
j = 1,2,..., M. This indicates that, for the ULA case, when 
M is sufficiently large, eigenvector matrix of the channel 
covariance matrix can be well approximated by the unitary 
discrete Fourier transform (DFT) matrix (up to some matrix 
elementary operations). Similar channel covariance matrix 
decomposition for the ULA case was derived in ifTTIl and 12TI . 
however, the result in Lemma [j] applies to the more general 
BS array configurations. In addition, the relationship between 
eigenvalues of the channel covariance matrix and channel PAS 
is established in Lemma [T] ■ 

The channel model proposed in Lemma [Q as well as the 
ULA case in Remark [T| is based on Assumption |T| where 
angular resolution of the antenna array is assumed to tend to 
infinity when the antenna number grows to infinity. It is well 
known that angular resolution of an array is proportional to 
the array size □ , 0. Thus, the channel model in Lemma 
Q] is applicable for arbitrary antenna array configuration with 
sufficiently large array size. However, in practical wireless 
communication systems, the antenna array size is always finite. 
Nevertheless, for a fixed array size, a fairly large number of 
antennas can still be accommodated at the BS if wireless 
transmission is performed over higher carrier frequency in, 
e.g., millimeter wave massive MIMO systems l22l . For ex¬ 
ample, considering the case where the antenna number equals 
128 and the carrier frequency equals 30 GHz, which lies in 
the millimeter wave spectrum, the size of the ULA with half 
wavelength spacing considered in Remark Q] is just 0.64 m. 
Note that the channel model in Lemma [T| for the ULA case 
has been shown as a good approximation with finite but large 
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number of antennas HD, ED- For these reasons, the proposed 
channel model is of great importance from both practical and 
theoretical perspectives. 

In this paper, we employ the widely accepted assumption 
that channels are wide-sense stationary |8), thus channel co- 
variance matrices can be obtained by the BS. However, station- 
arity of the realistic wireless channels can only be satisfied in a 
local manner, i.e., channel covariance matrices also vary over 
time. Thus, it requires that the channel covariance matrices 
being periodically estimated at the BS. Estimation of massive 
MIMO channel covariance matrices is rather challenging and 
resource-consuming (23) ■ However, from the result given by 
Lemma 1, only the eigenvalues rather than the whole massive 
MIMO channel covariance matrices need to be estimated, thus 
the number of parameters to estimate can be significantly 
reduced. In addition, the channel covariance matrices vary 
much less frequently than the instantaneous CSI, and thus 
can be estimated via averaging over time. Furthermore, the 
channel covariance matrices have been shown to stay constant 
over a wide frequency interval ||24) . and thus can be estimated 
via averaging over frequency in practical wideband systems. 
Therefore, there will be enough time-frequency resources to 
estimate the channel covariance matrices, and the estimation 
accuracy can be guaranteed in practice. In the rest of the paper, 
we will assume that the channel covariance matrices of all the 
UTs are known by the BS. 

We assume that the channel elements to be jointly Gaussian 
from the law of large numbers, i.e., g/, : ~ CAf (0. R/. ). We 
assume that channels of different UTs are mutually statistically 
independent, and denote the UL channels of all the UTs as 

G=[gi,..., g /r]eC Mxif . 

III. PR for UL Channel Training 

In this section, we present PR for UL channel training, 
and investigate how PR affects the channel estimation perfor¬ 
mance. Our following analysis applies to arbitrary PR pattern, 
while how to form the PR pattern exploiting the statistical CSI 
will be discussed in Section [V] 

We denote the UT set as 1C = {1,2where k £ 
K, is the UT index. We assume that the UL training interval 
length equals r(< /\ )0 and all the UTs transmit the respective 
pilot sequences in the length of r simultaneously during the 
training interval^ Note that the maximum number of available 
orthogonal sequences is equal to the sequence length, and we 
assume that the available orthogonal pilot sequence number 
equals r for simplicity. We denote the available orthogonal 
pilot set as T = {1,2, and the 7r th pilot sequence 

as x_ £ <D TXl where n £ T is the orthogonal pilot index. 
Different pilot sequences are assumed to satisfy the orthogonal 

! ll should be noted that the results obtained in the following are applicable 
for arbitrary r, and r can be either set to be a fixed number, or determined 
dynamically by the BS. One example of how to dynamically determine r 
based on the net spectral efficiency maximization criterion will be discussed 
in Section [VI-CI 

4 For the case that one UT sends pilot signals during several particular 
channel uses while remains silent during other channel uses, the pilot signals 
can be seen as a specific pilot sequence with several non-zero entries at the 
corresponding channel uses. 


condition that x^x T < = ruf ■ S (n — tt') where erf is the pilot 
signal transmit power. 

We denote an arbitrary PR pattern with UT set K, and pilot 
set T as V{1C,T) = {(fc,7r fe ) : k £ 1C, n k £ T} where 
(fc,7Tfc) £ V(1C,T) denotes that the 7r* pilot sequence x nk 
is allocated to the fc th UT. We use 1C^ = {k : n k = it} to 
denote the set of the UTs using the 7r th pilot sequence. 

With the PR pattern VOC,. T), the UTs transmit their as¬ 
signed pilots periodically to enable the BS to estimate the 
channels. During the UL training phase of each coherence 
block, the received pilot signals at the BS can be written as 

Y = GX + N<eC Mxt (11) 


where G is the UL channel matrix, X = 
[x Wl ,x^. 2 ,... ,x^. k ] T £ <C Kxt is the UL pilot signal 

matrix, N is the independent additive Gaussian noise matrix 
with elements distributed as independently and identically 
CAf [0, of}, and erf is the noise power during the training 
phase. After decorrelation and power normalization of 
the received signals E). the BS can obtain the channel 
observation of all the UTs. Specifically, for the A: lh UT in 
a given coherence block, the BS obtains the UL channel 
observation as 


k i 

= EE Se ■ S (ire ~ tt k ) + — N 4 k 
i=i <TxT 

= X b + 

Ot- A 


( 12 ) 


With the property of unitary transformation, it is not hard 
to show that the noise term -J—Nx* in (fl2l) is still Gaussian 

<7x T ~k H - * 

with elements distributed as independently and identically 
CjV fo, Let p p = erf/erf be the UL channel training 

signal-to-noise ratio (SNR), then ( 1 1 2| i can be rewritten as 


y 


p 


(eic *k 


i 

VWt 


n 


p 

TTfc 


g k+ & 

(&IC„ k \{k} 




pilot interference 


pilot noise 


(13) 


where “\” denotes the set subtraction operation, and n^ fc ~ 
CAf (0,1m) is the normalized additive noise. Note that 
represents the set of the UTs using the same pilot as the fc th 
UT, and the BS has to estimate the channels of all the UTs 
reusing the 7r* pilot based on the observation yf (; .. The MMSE 
estimate of the channel based on the channel observation 
yl k is given by 

g k = RfcC-iyP, (14) 


where 


= X R < + 



pPr 


(15) 


From the orthogonality principle of MMSE estimation E3, 
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channel estimation error g/,. = — g/,. is independent of gfc, 

and the covariance of g/. is 


= TL k - RfcC^Rfc. (16) 


Note that g/, : and Rg fc are also mean and covariance of g/, 
conditioned on y p fc , respectively ED. 

The estimation error covariance is an important measure of 
the estimation performance, and we define the MSE-CE as 


e P = £tr { R t J. 
fc=1 


(17) 


Before we proceed, we first define the orthogonality be¬ 
tween two arbitrary Hermitian positive semi-definite matrices 
using the angle (0 < 9 < tt/2) between them as 


9 (A, B) = arccos 


tr{A ff B} tr{AB} 

l|A|| F ||B|| F - arccos || A|| F ||B|| F ’ 

for A,B^0. (18) 


Then we present a condition under which the MSE-CE defined 
in (fT7] > can be minimized in the following theorem. 

Theorem 1: The minimum value of the MSE-CE e p is given 
by 

£ p = ^2 tr | Rfc “ Rfe | 

and the minimum is achieved under the condition that, for 

Mi, j £ K, and i ^ j, 

9 (Ri, Rj) = when 7Tj = Wj. (20) 

Proof: See Appendix iBl ■ 

In the MSE-CE metric defined in (fl7T >. correlations between 
the channel estimation errors seen by different UTs are not 
taken into account. Actually, the correlations between the 
channel estimation errors of the UT i and the UT j (j f i) 
can be obtained as 

E {g,gf} = E | ( gi - RiC^yPj ( gj - RjC^yP | 

= —RjC^Rj- ■ 5 (7Ti — TTj) (21) 

which indicates that channel estimation errors of the UTs with 
orthogonal pilots are independent, while those of the UTs 
reusing the pilots are correlated. However, if the condition 
given in Theorem [7] is satisfied, then -RjC” 1 R ; = 0, 
i.e., channel estimation errors seen by different UTs will be 
uncorrelated no matter whether they reuse the pilots or not, 
and the condition given in Theorem |T| is still optimal. 

To obtain clear insights of Theorem [Q we consider the 
asymptotic antenna number case, and the following corollary 
can be readily obtained from Lemma [T] 

Corollary 1: When the BS antenna number M —> oo, the 
MSE-CE e p can be minimized provided that, for Mi,j £ K, 
and i f j, 

(r i ,r j )=0, when -K i = 'Kj (22) 

where r, for fixed positive integer i is given in Lemma |T] ■ 
The result in Corollary Q] indicates that the MSE-CE e p 


can be minimized if the UTs reusing the pilots have non¬ 
overlapping channel AoA intervals. This result is very intu¬ 
itive, as in such cases, the channels of different UTs are strictly 
separated in the angular domain, and the pilot interference 
does not take into effect. Moreover, in the high SNR regime 
where the training SNR p p —y oo, the pilot noise vanishes, 
and then the MSE-CE e p —> 0, which implies that channel 
estimations tend to be perfect. 

Although the conditions in Theorem Q] and Corollary Q] are 
desirable, they cannot always be well satisfied. However, in 
realistic outdoor propagation environments where the BS is 
located at an elevated position, channel AS seen by the BS 
is usually small a, urn which indicates that most of the 
channel power is concentrated in a narrow angle interval, and 
the channel power outside this angle interval is very small. For 
UTs located geographically apart in different spatial directions, 
the overlaps of their channel power in the angular domain 
might be neglected, and thus PR becomes feasible in such 
spatially correlated massive MIMO channels. 

IV. Robust UL/DL Data Transmissions 
In the previous section, we showed feasibility of PR for 
massive MIMO transmission, and presented UL channel train¬ 
ing with PR. In each coherence block, the BS obtains the 
channel estimates of all the UTs after UL channel training. The 
conventional data transmission design in massive MIMO treats 
the channel estimates as the real channels. However, with PR, 
the channel estimation performance will degrade in most cases, 
thus a robust data transmission design with channel estimation 
errors taken into account is of paramount importance in the 
considered PR based massive MIMO transmission. There are 
two main approaches to design a wireless system robust 
to the channel uncertainty: the worst-case approach and the 
statistical approach. In the worst-case approach, the channel 
uncertainty is modeled as being within a given set around the 
channel estimate, and a worst-case transmission performance 
can be guaranteed |[27l . In the statistical approach, the channel 
uncertainty is modeled using the channel statistics, such as the 
mean and the covariance, and a statistical average performance 
can be guaranteed ll28l . In this work, we employ the statistical 
approach to model the channel uncertainty. Specifically, in 
each coherence block, based on the received pilot signals, 
the CSI uncertainty at the BS can be modeled statistically 
using its conditional distribution, i.e., the conditional mean 
(the MMSE channel estimate) and the conditional covariance 
(the covariance of the channel estimation error). Note that the 
channel estimation error covariance R g( . given in (1 1 6b depends 
on the PR pattern V(IC, 7") and the channel covariance, and 
thus can be known by the BS. In the following, we will develop 
robust data transmissions for UL and DL, respectively, under 
the MMSE-SD criterion. 

A. Robust UL Data Transmission 

During the UL data transmission phase, the signal received 
at the BS at a channel use in the given coherence block can 
be expressed as 

yU = GaU + ^ n " = ( 6 + <s ) a " + ^"" (23) 
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where G = [gi, g 2 , ■ ■ ■, gic] is the channel estimate, G = 
[gi,..., gx] is the channel estimation error, a 11 € C Kxl with 
mean 0 and covariance \k denotes the UL data signal vector 
where [a u ] fc is the signal sent by the fc th UT, n li ~ CM (0, Im) 
is the independent additive noise, and p a is the UL data 
transmission SNR per UT. 

We consider the linear receiver at the BS 

a u = W T y u (24) 

and then MSE-SD of the UL transmission in the given 
coherence block can be defined as 

e u = E {||a u — a u 11 2 } (25) 

where the expectation is with respect to a u , n u , and the channel 
estimation error G. 

Finding the optimal UL receiver based on the MMSE-SD 
criterion can be formulated as 


min e u (26) 

w 


and we present the solution in the following theorem. 

Theorem 2: The optimal solution to the problem < [26t is 
given by 


W°P ( 



K 


E^ + 7 1 , 


fc =1 



and the corresponding MSE-SD is given by 


(27) 


i+g ' , (I r “ + e i ) ’ 

(28) 

Proof: See Appendix ICl ■ 

For the conventional receiver with channel estimates as¬ 
sumed to be accurate, the impact of the channel estimation 
error is omitted. While for our robust MMSE receiver design, 
the channel estimation error due to PR is taken into account. 
Specifically, the expectation in ( l25l > accounts for the chan¬ 
nel estimation error G, which leads to our robust MMSE 
receiver. Note that the robust MMSE receiver given in i fTTl i 
exhibits a similar structure to the conventional receiver. When 
Y^n=i iig„ > 0, the robust MMSE receiver in (l27l ) reduces 
to the conventional receiver 



W con 


(gg h + -^i^ g 


(29) 


€ K x 1 with mean 0 and covariance I/ N - denotes the DL data 
signal vector where [a d ] is the signal for the fc th UT, n d ~ 
CM (O. Lk) is the independent additive noise, p d is the average 
DL data transmission SNR per UT, and B is the DL linear 
precoding matrix which satisfies the power constraint 

tr{BB h }<K. (31) 


Then MSE-SD of the DL transmission in the given coherence 
block can be defined as 

e d = E 111 cty d — a d 11 ^ j (32) 

where the expectation is with respect to a d , n d , and G, and a 
is a real scalar parameter corresponding to the potential power 
scaling performed at the UTs@ 

Finding the optimal DL precoder based on the MMSE-SD 
criterion can be formulated as 


min 

B,a: 

subject to 


tr{BB ff } < K 


(33) 


and we present the solution in the following theorem. 

Theorem 3: The optimal solution to the problem (l33l > is 
given by 


B °Pt = _L 


7 


opt 


K 


GG 


H 


5Z + pd 1 


fc=1 


(34) 


a° p t = 7° pt 


(35) 


where 7 ° pt is chosen to satisfy the power normalization con¬ 
straint tr |B opt (B opt ) H } = K, its value is given by 


*" 

G H 

(GG ff +E% + ^ 

'•1 

K 


and the corresponding MSE-SD is given by 


(36) 


i + 6 ''(E r «. + 7 i ) e j 

(37) 

Proof: See Appendix [D] ■ 

From Theorem 0 we can observe that, similarly to the UL 
case, the robust MMSE precoder in (f34l) also embraces the 
structure of the conventional precoder. 



B. Robust DL Data Transmission 

During the DL data transmission phase, the signal received 
at the UTs at a channel use in the given coherence block can 
be expressed as 

y d = G T Ba d + —^n d = (g + gV Ba d + —Un d 

(30) 

where the DL channel G T is the transpose of the UL channel 
due to the channel reciprocity of the TDD systems m, a d G 


C. UL-DL Duality 

From the results in Theorem [2] and Theorem [3] we can 
readily obtain the following UL-DL MMSE duality. 

Corollary 2: In each TDD coherence block, if p u = p A , 
then B opt = W opt / 7 ° pt , and e u - min = e 4 ™". ■ 

The result in Corollary [2] indicates that, in the same TDD 
coherence block, if the UL data transmission SNR equals the 
DL data transmission SNR, then the robust DL precoder in 

5 Note that a is a real scalar, and the overhead for the UTs to obtain it can 
be neglected. 
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( [34} can be achieved by the robust UL receiver in ([27} with 
proper power normalization, and the complexity of computing 
the robust MMSE DL precoder can be reduced. In addition, 
if the robust MMSE receiver and robust MMSE precoder are 
used for data transmissions, then the same MMSE-SD can 
be achieved in both the UL and DL in each TDD coherence 
block. Note that similar UL-DL duality based on the perfect 
CSI assumption was provided in the literature such as l29l 
and ED, however, our result in Corollary [2] is established on 
the pilot-assisted CSI acquisition assumption. 

In this section, we have investigated the robust UL and DL 
data transmissions with channel estimation error due to PR 
taken into account. It will be seen in the following section 
that, PR based massive MIMO transmission, which combines 
pilot scheduling and robust data transmission, can achieve the 
MMSE-SD optimality for both the UL and DL. 

V. Pilot Scheduling 

Up to now, we have investigated channel training and data 
transmission of the massive MIMO transmission with PR, and 
the obtained results are applicable to arbitrary PR pattern. In 
this section, we study pilot scheduling which exploits the long 
term statistical CSI to allocate the available pilot signals to the 
UTs, and we focus on two MMSE related criteria. 


A. MMSE-CE Criterion 

CSI is critical to massive MIMO transmission, and it is 
natural to design the pilot scheduler based on the MMSE-CE 
criterion, which leads to the following problem 

min e p (38) 

V(IC,T) 

where e p is defined in (ITT} . 

The pilot scheduling problem in (138} is combinatorial, 
and the optimal PR pattern V(K,,T) can be found through 
exhaustive search (ES). The complexity of the ES in ([38}, in 
terms of the (complex) scalar multiplication number which 
dominates the computational complexity, is briefly evaluated 
as follows. Recalling ( [XT} , the scalar multiplication number 
required in evaluation of the objective function in (138} is 
0(M 3 K). Thus, the computational complexity of running ES 
under the MMSE-CE criterion is 0(t k M 3 K). 


B. MMSE-SD Criterion 

MSE-SD is an important performance measure of the data 
transmission, and in the sequel we study the pilot scheduler 
design regarding the MSE-SD metric. Due to the UL-DL 
MMSE-SD duality in each coherence block presented in 
Corollary [2] we assume that p u = p d for simplicity, and 
denote that p u = p A = p l and e u ’ mm = e d - min = e tnun, 
where the superscript “t” stands for expression related to data 
transmission. We consider pilot scheduling under the MMSE- 
SD criterion, which can be formulated as 

min E { e 1 ’™" 

V(K,T) 1 



where the expectation is with respect to the channel fading 
and the noise distributions, and the effective noise covariance 
matrix is defined as 

K 1 

Rt.n.effA£ Ri , L (40) 

fc =1 P 

The objective function in (139} is the average of the MMSE-SD 
that can be achieved by the robust MMSE receiver and robust 
MMSE precoder in each coherence block, and it depends on 
the statistics of the channel fading and pilot noise distributions. 
It should be noted that here we still use the term MMSE-SD 
for brevity, however the meaning of it differs from that when 
we consider the designs of the receiver and precoder in the 
previous section. 

Due to the difficulty in obtaining the closed-form expression 
of the objective function E | e t min | in ([39}, we first present a 
lower bound of it in the following lemma. 

Lemma 2: The average MSE-SD E{e t,min } is lower 
bounded by 

E {e 1 ’™"} > e t,alb = tr {(I* + fl) _1 j (41) 

where for fixed positive integers i and j, 

[O]. . = tr {c-iR; (R u ’ eff ) Rj} • <5 (7r* -7 Tj). (42) 

Proof: See Appendix [EJ ■ 

It will be seen in Section IVI-AI that, the lower bound 
presented in Lemma [2] is tight over a wide SNR region. By 
replacing the objective function E {g 1 - 1 ™ 1 } with its lower bound 
presented in Lemma [2] the pilot scheduling problem ( [39} can 
be simplified as 


min e t,alb . (43) 

V(K,T) 

The pilot scheduling problem in (143} is also combinatorial. 
The optimal PR pattern T^/CjT) can be found through ES. 
Note that the scalar multiplication number required in evalua¬ 
tion of the objective function in d43} is 0(M 3 K 2 ), thus, the 
computational complexity of running ES under the MMSE-SD 
criterion is 0{r K M 3 I\ 2 ). 

Before we proceed, we present a condition under which e t alb 
can be minimized in the following theorem. 

Theorem 4: The minimum value of the lower bound aver¬ 
age MSE-SD e t,alb is given by 


£ 


t 


K 


E 


i 

i + Mi 


(44) 


where [c<->]for fixed positive integer i is given by (145} , shown 
at the top of the next page, and the minimum is achieved under 
the condition that, for \/i,j £ K, and i ^ j, 

7r 

(((R^Rj) = when 7U = nj. (46) 

Proof: See Appendix [0 ■ 

Recalling Lemma Q] we can readily obtain the following 
corollary. 

Corollary 3: When the BS antenna number M —> oo, the 
lower bound average MSE-SD e t,alb can be minimized provided 
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Mi = tr 



( Rfe—Rfc 

_fc=i v 






(45) 


that, for \/i, j £ K, and i ^ j, 

( r i, Tj) = 0, when m = -Kj (47) 

where ri for fixed positive integer i is given in Lemma |T] ■ 
Interestingly, conditions for optimal data transmission ob¬ 
tained in Theorem [4] and Corollary [3] are the same as those 
for optimal channel training obtained in Theorem Q] and 
Corollary Q] The intuitive interpretation lies in that, for massive 
MIMO transmission, if channels of the UTs reusing the 
pilots can be rigorously spatially separated, then not only the 
pilot interference but also the transmission data interference 
vanishes. Furthermore, in the high SNR regime where both 
the training SNR p p and transmission SNR p' tend to infinity, 
the remaining additive noise vanishes, and the average MSE- 
SD e l —> 0. This result shows that the PR based transmission 
scheme, which combines pilot scheduling and robust data 
transmission, can achieve the MMSE-SD optimality. 


C. SGPS Algorithm 

In the above subsections, we have investigated pilot schedul¬ 
ing under two MMSE related criteria. In both cases, the 
designs are formulated as combinatorial optimization prob¬ 
lems, and the optimal PR patterns can be formed through ES. 
However, due to the exponential complexity, ES becomes hard 
to implement in practice as the UT number grows. 

In this subsection, we propose a low complexity pilot 
scheduling algorithm called the statistical greedy pilot schedul¬ 
ing (SGPS) algorithm which is motivated by the conditions 
for optimal channel estimation and data transmission given in 
Theorem [j] and Theorem [4] and the main idea is that channel 
covariance matrices of the UTs reusing the pilots should be 
as orthogonal as possible. Detailed description of the SGPS 
algorithm is summarized in Algorithm Q] Coordinated pilot 
allocation algorithm for mitigating the inter-cell pilot contam¬ 
ination using similar idea was proposed in CD, however, the 
above SGPS algorithm is dedicated for the single-cell scenario. 

We evaluate the complexity of the SGPS algorithm as 
follows. In the process of the SGPS algorithm, no more than 
— m) = ( K — l)K(I\ + l)/6 orthogonality 
calculations defined in (fTSl ) are needed. Note that the scalar 
multiplication number needed in each orthogonality calcula¬ 
tion is 0(M 2 ), thus, the computational complexity of running 
the SGPS algorithm is 0(M 2 K 3 ). In above subsections, we 
have shown that the ES complexity under the MMSE-CE 
and MMSE-SD criteria are 0{t k M 3 K ) and 0(t k M 3 K 2 ), 
respectively. This indicates that the SGPS algorithm gives a 
significant computational complexity reduction compared with 
ES. Meanwhile, simulation results in Section [VLB] will show 
that performances of the low complexity SGPS algorithm can 
closely approach those of ES. 


Algorithm 1 Statistical Greedy Pilot Scheduling (SGPS) Al¬ 
gorithm 

Input: The UT set /C = {1,2,... ,K} and the channel 
covariance information R,£ 1C), the orthogonal pilot 
set T with the pilot length r(l < r < K) 

Output: PR pattern V(K.,T) = {(k,7Tk) : k £ /C,7Tfc £ T} 
l: Initialize the unscheduled UT set JC UD = 1C, the unused 
pilot set T™ = T 

Step 1) Schedule the UTs with “similar” channel covari¬ 
ance matrices and assign them with orthogonal pilots 
2: m i = 1, 7Ti = 1, /Ci = {1}, /C un £- /C un \{ 1}, T un ■£- 
T un \ {1} 

3: while T un ^ 0 do 

4: For the pilot t £ T un , select the UT rn t = 

arg max XMt\T“" cos 0 M) 

5: Assign the pilot t to the UT nit, n mt = t, K-t = {m-t} 

6: Update /C un £- /C un \ {m t }, T un <- T un \ {t} 

7: end while 

Step 2) Each unscheduled UT is assigned with the “best” 
pilot so that the channel covariance matrices of the UTs 
reusing the pilots are as orthogonal as possible 
8: while AC™ ^ 0 do 

9: For the UT k £ AC™, select the pilot nk = 

arg min Y^seic COS0 (R fe , R s ) 

9GT q 

10: Assign the pilot nk to the UT k, nk = rik, IC nic <— 

IC nk U {fc} 

li: Update AC un AC un \ {/c} 

12: end while 


VI. Numerical Results 

In this section, we present numerical simulations to evalu¬ 
ate performances of the proposed PR based massive MIMO 
transmission. We assume that the BS is equipped with the 128- 
antenna ULA, and the antennas are spaced with a half wave¬ 
length distance. We set the AoA interval as A = [—7t/2, 7t/ 2]. 
We consider the typical outdoor wireless propagation environ¬ 
ments where the channel PAS can be modeled as the truncated 
Laplacian distribution fl8l . OH given by ( 1481 ). shown at the 
top of the next page, where <7k and Ok represent the AS and the 
mean AoA of the fc th UT’s channel, respectively. We assume 
that channel ASs are the same for all the UTs so that Uk = cr 
(V/c). We assume that all the UTs are of equal distance from 
the BS, and set the large scale fading coefficients as /3k = 1 
(V/c). We assume that the UTs uniformly locate in a 120° 
sector, i.e., the mean channel AoA Ok is uniformly distributed 
in the angle interval [—7 t/3,7t/ 3] in radian. The channel 
covariance matrices of the UTs are generated according to 
the model given by Remark Q] and we impose the constraint 
in (0 for channel power normalization. We assume that the 
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cLap 

D k 


( 0 ) = 


V2a k (l - exp (—v^T^/cfc)) 


• exp 


'-y/2\e-0 k \ 


& k 


for 0 e [dk - 7r, 9k + 7r] 


(48) 


channel training SNR and the data transmission SNR are equal 
such that p p = p u = p d = p. 

A. Performance of Robust Transmission 

In this subsection, we employ the average MSE-SD metric 
to evaluate performances of the robust receiver and precoder 
developed in Section [TV] Due to the UL-DL MMSE duality 
given in Corollary [2] we only consider the UL transmission 
case for brevity. 

We compare performances of the robust MMSE receiver 
given in ( l27l > with those of the conventional receiver given in 
(l29l >. We consider the case with K = 10, er = 10°, and the 
mean channel AoAs of the UTs from UT 1 to UT 10 are 

[0.6592,0.8499, -0.7812,0.8658,0.2772, 

- 0.8429, -0.4639,0.0982,0.9582,0.9737] 

in radian. We assume that the pilot length equals t = 5, and 
consider two PR patterns. Specifically, the pilot indices that the 
UTs use from UT 1 to UT 10 are [1,1, 2, 2, 3, 3,4,4, 5, 5] and 
[1, 2, 3,4, 3, 5,4, 5, 5,3] for PR patterns A and B, respectively]^ 
In Fig. [I] we plot the average MSE-SD performances of 
the robust MMSE receiver (using the true and the estimated 
channel covariance matrices that are obtained via averaging 
over 100 samples, respectively) and the conventional receiver. 
The lower bound of the average MSE-SD achieved by the 
robust MMSE receiver given in Lemma [2] is also shown. We 
can have the following observations: 1) the average MSE- 
SD performance loss using the estimated channel covariance 
matrices compared with true channel covariance matrices can 
be almost neglected; 2) the robust MMSE receiver outperforms 
the conventional receiver, especially in high SNR regime 
where pilot interference dominates; 3) compared with the 
robust MMSE receiver, the conventional receiver is quite 
sensitive to the channel estimation error, and increasing the 
SNR may result in additional MSE-SD for the conventional 
receiver; 4) the closed-form lower bound of the average MSE- 
SD given in Lemma [2] is tight over a wide SNR region for 
different PR patterns; 5) pilot scheduling is crucial to the data 
transmission performance. 

B. Performance of SGPS Algorithm 

In this subsection, we evaluate performances of the SGPS 
algorithm, and compare them with those of ES. In Fig. [2] and 
Fig. |3] we plot the MSE-CE metric in (fT71 > and the average 
MSE-SD metric in (l-iTT ) versus the pilot length for different 
values of SNR with K = 10 and a = 10°, respectively. It 
can be observed that, in both cases the performances of the 

6 PR pattern B is determined by the SGPS algorithm proposed in Section 
Ec] and we arbitrarily set a PR pattern as pattern A for comparison. We 
employ such setting to exemplify that pilot scheduling is crucial to the 
transmission performance. 



Fig. 1. Comparison of the average MSE-SD performances between the 
robust MMSE receiver (using the true and the estimated channel covariance 
matrices that are obtained via averaging over 100 samples, respectively) and 
the conventional receiver. Results are shown versus the SNR for two specific 
PR patterns with K = 10, r = 5 and a = 10°. The lower bound of the 
average MSE-SD achieved by the robust MMSE receiver is also depicted. 



Fig. 2. Comparison of the MSE-CE performances between the SGPS 
algorithm and ES. Results are shown versus the pilot length for different 
values of SNR p with K = 10 and a = 10°. 

SGPS algorithm closely approach those of ES over a wide 
SNR region for different values of pilot length. 

C. Net Spectral Efficiency Comparison 

In this subsection, we compare the net spectral efficiency 
performance between the proposed PR scheme and the con¬ 
ventional OT scheme. The net spectral efficiency is given by 

R net R (49) 
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Fig. 3. Comparison of the average MSE-SD performances between the SGPS 
algorithm and ES. Results are shown versus the pilot length for different values 
of SNR p with K = 10 and a = 10°. 


Fig. 4. Comparison of the UL spectral efficiency performances between the 
PR scheme and the OT scheme. Results are shown versus the coherence block 
length T for different values of AS a with K = 10 and p — 20 dB. 


and the achievable rate R ach can be set as the UL sum 
achievable rate f? u,sum , or the DL sum achievable rate i? d,sum , 
or the weighted summation of f? u,sum and f? d,sum . The UL sum 
achievable rate f? u,sum 10, E2 is given by (l50t . shown at the 
top of the next page, where w/- is the fc th column of the UL 
receiver matrix W given in ( f27| i. The DL sum achievable rate 
f?d,sum 0, ifTOl is given by ( 15 1 [ i. shown at the top of the next 
page, where 1> /,. is the k th column of the DL precoding matrix 
B given in dm . and a is the power scaling performed at the 
UTs given in (l35l >. 

For the PR scheme, we consider a dynamic pilot length 
strategy. Specifically, for a given UT set, the achievable rates in 
(l50l > and (|5H can be obtained for arbitrary pilot length t(<K) 
with pilot scheduling performed by the SGPS algorithm, and 
then the optimal pilot length and the net spectral efficiency 
can be obtained. While for the OT scheme, the pilot length 
r is set as r = K if K < T/2, or r = |T/2J if K > T /2 
where only [T /2J UTs are serviced simultaneously 0. 

The UL net spectral efficiency performances of the PR 
scheme and the OT scheme are compared in Fig. [4] and 
Rg. a while the DL net spectral efficiency performances are 
compared in Fig. a and Fig. [7] with K = 10. It can be 
observed that, the proposed PR scheme shows performance 
gains over the conventional OT scheme in terms of the net 
spectral efficiency, and the gains become larger as the channel 
AS becomes smaller. Moreover, in the high SNR regime where 
the pilot interference dominates, and in the small coherence 
block length regime where the pilot overhead dominates, the 
proposed PR scheme provides significant performance gains. 
Specifically, for the case with K = 10, a = 2°, p = 20 dB and 
T = 20, the proposed PR scheme provides approximately 35 
bits/s/Hz net spectral efficiency gains over the conventional 
OT scheme for both the UL and DL data transmissions. 

VII. Conclusion 

In this paper, we proposed pilot reuse (PR) in single cell for 
massive MIMO transmission to reduce the pilot overhead. We 



Fig. 5. Comparison of the UL spectral efficiency performances between 
the PR scheme and the OT scheme. Results are shown versus the SNR for 
different values of AS a with K = 10 and T = 20. 


exploited the fact that, in realistic outdoor wireless propagation 
environments where the BS is located at an elevated position, 
most of the channel power lies in a limited number of spatial 
directions compared with the whole massive MIMO channel 
dimension, and thus PR among UTs of spatial localization 
becomes feasible and beneficial. We first established the rela¬ 
tionship between the channel covariance matrix and the chan¬ 
nel PAS for the massive MIMO channels. Then we showed 
that, if channel AoA intervals of the UTs reusing the pilots 
are non-overlapping, then MSE of the channel estimation 
can be minimized. We also developed the robust multiuser 
UL receiver and DL precoder with the channel estimation 
error due to PR taken into account, and revealed the UL-DL 
MMSE duality between them. Moreover, we presented pilot 
scheduling under two MMSE related criteria, and proposed 
a low complexity pilot scheduling algorithm motivated by 
the channel AoA non-overlapping condition. The simulation 
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i?u,sum = ^ E ji 0 g 2 
fc= 1 [ 

i? d,su m = ^ log2 1 + 

fc=1 \ 


1 + 



l w fcgfc| 

2 

1 

Sra&m 


1 w fc 


i 

E{agfcb fc } 

2 

yf 

/m=l j 

[a 2 

|gfe b m 

I'] 

b 

E{ag^b fc } 

| 2 + iE{a 2 } 


(50) 


(51) 



Fig. 6. Comparison of the DL spectral efficiency performances between the 
PR scheme and the OT scheme. Results are shown versus the coherence block 
length T for different values of AS a with K = 10 and p = 20 dB. 



Fig. 7. Comparison of the DL spectral efficiency performances between 
the PR scheme and the OT scheme. Results are shown versus the SNR for 
different values of AS cr with K = 10 and T = 20. 


Appendix A 
Proof of Lemma Q] 

From the definition of V in ©. we have 


lim [V H V-I 


M—t oo 


Ml 


h3 


= Jim "tt (v (0 (V>t- i))) H v (0 (ipj- 1 )) -S(i- j ) 

M—>■ oo lVl 


^5(i-j)-6(i-j) = 0 


(52) 


where (a) follows from Assumption |T] and 0( , 0) is a strictly 
increasing function. This concludes the proof of ©. 

The proof of <(8]» can be obtained as 

lim [R fc - Vdiag {r fc } V ff ] 

M-¥ oo hJ 

= lim [R fc ]j ■ - 

M->oo 


lim 

M—> oo 


1 


M 


M 51 [ r fc]m V (^W’m-l))v ff (0(t/> m _i)) 
m=l 

M 

= lim [R k\ id ~ Pk lim t v 1))]* 

M—>oo J oo z — 

m=l 

‘ [V (# Sfc (tf (^m-l)) [# Wm) - $ (V’m-l)] 

6>max 

= J [v(0)Uv(0)]*S fc (0)d0 




- Pk 


V (0 (VO)]* [v (0 (V , ))]j Sfe (tf WO) dd (r/>) 

( % / [v(0)Uv(0)]*S fe (0)d0 


o) 

a max 


^max 


-A J [v (0)], [v (0)]* S fe (0) d0 = 0 (53) 

^min 

where (a) follows from ©, (b) follows from ((6|. (c) follows 
from © and the integral definition, (d) follows from that 

t? (ip q) = 0 (0) = 0™" and 0 (ip M ) = 0(1)= 0 max . 


results showed significant performance gains of the proposed 
PR scheme over the conventional OT scheme in terms of net 
spectral efficiency. 


Appendix B 
Proof of Theorem|7] 

We start by presenting a lemma that is required in the 
following proof. 

Lemma 3: For A ^ 0 and B ^ 0, 0 (A, B) = 7 t/ 2 is 

equivalent to AB = 0. 
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Proof: Recalling ( fl8l ), 0(A, B) = 7r/2 is equivalent to 
tr{AB} = 0. Furthermore, tr {AB} = 0 is equivalent to 
AB = 0 for A ^ 0 and B ^ 0 01 Prop. 4.26]. This 
concludes the proof. ■ 

Now we proceed with the proof of the theorem. Due to the 
positive semi-definiteness of the covariance matrix, we can 
obtain 


tr {Rg fc } = tr {R fc - RfcC^Rfc} 


-1 


= tr ^ R/c — Rfc [ ^ Rt + —I J Rfc 

> tr | R fc — R fc ^R fc + Rfc | (54) 

From Lemma 1 which states that 0(Rj,Rj) = 7r/2 is 
equivalent to R,Rj = 0, we can obtain 

C„R fc = | V R< + 2-l] R fc 

p T J 

= (^Rfc + — Rfc = Rfc ^Rfc + (55) 


which indicates that 


1 

p p r 


-l 


C,’Rfc — Rfc ( Rfc 4——I ) — ( Rfc + —) Rfc- 


-l 


Substituting (l56l> into (fT6l >. we can obtain 


tr{Rg fc } = tr |R fc - Rfc |^Rfc + —IJ Rfc j (57) 

which achieves the minimum in (l54l >. This concludes the proof. 


Appendix C 
Proof of TheoremH 


The MSE-SD defined in (1251) can be simplified as 
e u = tr j W T |gG h + J2 R i* + 7 1 ) W * 

+ 1-w t g g h w* l. 


Note that e u is convex with respect to W. 

By setting the derivative of e u with respect to W* l34l to 
zero, 

, T 


A, we can obtain 


W opt = 


K 


GG ff + ^Rg fc + -I G 


fc=i 


(60) 


Substituting (l60l > into (l58l) . we can obtain the corresponding 
MSE-SD as 

e u -min = tr 1 1 - G^ ( GG H + Rg fc + AA G 


k=1 


( a ) f 
= tr 


K 


-n 


I + G H E R i fc + ^ 


\k =1 


(61) 


where (a) follows from the Woodbury matrix inversion identity 
Prop. 15.3]. This concludes the proof. 

Appendix D 
Proof of TheoremI 

We start by simplifying the MSE-SD defined in 

2 


as 


e d = E- 


a G T Ba d 


1 


V7 


n d - a d 


= E 11| (aG T B — I) a d ||]]| 


a 2 K 


K 


(56) 


= tr< a 2 B H G*G t + ^ R| 


B 


a 


+ hr + 1 \K. (62) 


\ k =1 

- aG T B - aB H G* 


The simplified objective function in (1621) is non-convex with 
respect to (B,a). We first show that there exists a global 
minimum for the problem (l33l > in the following lemma. 

Lemma 4: For the problem ( 1331 ). there exists a global opti¬ 
mal solution. 

Proof: The problem in ( 1331 ) is equivalent to 


mm mm 

b a ( B) 


(B, a) 


subject to tr{BB H }<A 


(63) 


and the optimal a for the inner unconstrained optimization 
problem can be readily obtained as 


(58) 


tr{G T B + B H G*} 


K/p* + tr {B^ (g*G t + Ef =1 B} 


(64) 


Then the problem (163b is equivalent to 

,d, 


min e d (B) 

B 


d 


dW* 


= ( GG h + J2 R g fc + 4 1 ) W - G* = 0 (59) 


K 


fc=1 
K 


1 

P U 


G* ^ 

tr 

{«’ 

’b + b h g*| 

] 2 

4 

K/p d + tr | 

B H 

(g-g^+e^riJ 

B ) 

>] 


fc=1 


where (a) follows from A 1 = A* for the Hermitian matrix 


subject to tr{BB ff }<iT. (65) 

The feasible set of ( |65| > given by {B : tr{BB ff } < K} 
is compact (closed and bounded), and the objective function 
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of ( l65l > is continuous over the feasible set. Thus, according 
to Weierstrass extreme value theorem ||35l Appx. E], there 
exists a global minimum for the problem ( l65k and so does 
the equivalent problem (l33k ■ 

Lemma Q] shows that there exists a global optimum for the 
problem (1331 . Note that the global optimal solution should 
satisfy the Karush-Kuhn-Tucker (KKT) necessary conditions 
l36l . In the following we will seek out all the solutions that 
satisfy the KKT conditions and identify the optimal solution 
among them. 

The Lagrangian associated with the problem (l33l > is 


C (B, ct, A) = e d + A (tr {BB H } — K) (66) 


where e d is given in (l62l >. and A is the Lagrange multiplier 
associated with the inequality constraint. 

The KKT necessary conditions for the problem (l33l > can be 
obtained as l34l 


K 


dB 


-C (B, a, A) = a 2 G*G T + ^ R| fc B 


k =1 


- aG* + AB = 0 


(B, a, A) = 2atr | B ff (g*G t + £ R| fc 

— tr |g t B + B ff G* | + = 0 

A > 0, tr {BB h } < K 
A(tr{BB ff } - K) =0. 


(67) 

1 

( 68 ) 

(69) 

(70) 


An obvious solution that satisfies the above KKT conditions 
is (a = 0, B = 0, A = 0), and the corresponding e d equals K. 
For the case with a ^ 0, (l67l) is equivalent to 


K 


A 


«• = ■ «'tE R L 5 i b 


k=l 


which leads to 

g t b = b h g* 


K 


= aB H I G*G 




E R j 

k= 1 


Combining (l72l ) with (l68l) yields 

oc K f if i 

—= Atr {BB ff } . 
pd t J 

Substituting (f73l i into (f70t . we can obtain 


A = — > °, 

P d 


tr{BB ff } = K. 
Substituting (f74b into (fTTl i yields 

B = ^f ,5 * d;T + E R i. + ^ I 'l 


k= 1 
K 


GG" + X>«, +-,1 


G 


fc=1 


(71) 


r I B. (72) 


(73) 


(74) 


(75) 


where a is chosen to satisfy the constraint tr {BB ff } = K. 

Substituting d75l) into d62k we can obtain the corresponding 
MSE-SD as 

e d = tr|l — G r ^G*G r + gR| fe + llj G* 


K 


= tr<|l-G ff (GG^ + ^R^ + ll) G 


k=l 


(76a) 


K 


-1 


-1\ 


= tr{ I + G h ^2 Rgfc + “d I ) G 


U=1 


(76b) 


where (I76ab follows from the trace identity tr {A} = tr { A 1 } 
ll34l Eq. (2.95)] and Rg fc is Hermitian, (176bb follows from 
the Woodbury matrix inversion identity l33l Prop. 15.3]. Note 
that the MSE-SD in ( I76ab is smaller than K that previously 
obtained from the solution (ct = 0, B = 0, A = 0). Therefore, 
we obtain that the precoder given by ( 1751 ) is optimal. This 
concludes the proof. 


Appendix E 
Proof of Lemma[2] 


Via invoking the matrix-valued Jensen’s inequality which 
states that EjA -1 } A (E{A}) _1 for A ^ 0 [[32I Prop. 
21.64], we can obtain 


E { gt.min } > tr 


= tr 


| (in + E |g h (R t-n,eff ) X g}) 

+ (77) 


and il satisfies that 


Mi 




= E { G h (R t-n,eff ) ' g} 
l J J i,j 

= E{gf (R t ' n - eff ) _1 g j } 

=E {C-'B., (R '-"’^)" 1 H,CL} 

— tr {C-'R, (Rt.n.eff ) -1 Kj j g (7r . _ ^ (?8) 


where (a) follows from (fl4k and (b) follows from (fl3l) . This 
concludes the proof. 


Appendix F 
Proof of Theorem[4] 

Via invoking the Schwartz inequality as in [37] Lemma 1], 
we can obtain 

e 1 * =«, {(i* + nr 1 } > g TT p- < w > 

where the equality is attained if and only if Cl is diagonal. 

Recalling Lemma [3 which states that R,R y = 0 is equiv¬ 
alent to 0(Rj,Rj) = 7 t/ 2, we only have to show that if 
R.,R, 7 = 0 for V* ^ j and tt, = nj, then il is diagonal, 
i.e., [H] i . = 0 for V* ^ j and 7Tj = 7 Tj. 
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For Vi ^ j and i u = 7ij, if = 0, then 
= tr {c^/Ri Rj } 


= tr \ R, I R, 


p p r 


— tr < RjR^ | Hi —-— I ) (R 1 


(Rf,Il, e ff) R 

“I l 


t,n,eff\ 


= 0 (80) 


where (a) follows from 

Furthermore, if R R, = 0 for V* / j and ~ 7 = tt 3 , then 
diagonal elements of fl reduces to 


1 


+ pPr 1 ' 


-l 


K 


k= 1 


-1 


^ ( R/c R fc yR k + j + ^ t I 


(81) 


via invoking (156b . This concludes the proof. 


References 

[1] L. You, X. Gao, X.-G. Xia, N. Ma, and Y. Peng, "Massive MIMO 
transmission with pilot reuse in single cell,” in Proc. IEEE Int. Conf. 
Communications (ICC), Sydney, Australia, 2014, pp. 4794^4799. 

[2] T. L. Marzetta, “Noncooperative cellular wireless with unlimited num¬ 
bers of base satation antennas,” IEEE Trans. Wireless Commun., vol. 9, 
no. 11, pp. 3590-3600, Nov. 2010. 

[3] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta, 
O. Edfors, and F. Tufvesson, “Scaling up MIMO: Opportunities and 
challenges with very large arrays,” IEEE Signal Process. Mag., vol. 30, 
no. 1, pp. 40-60, Jan. 2013. 

[4] E. G. Larsson, O. Edfors, F. Tufvesson, and T. L. Marzetta, “Massive 
MIMO for next generation wireless systems,” IEEE Commun. Mag., 
vol. 52, no. 2, pp. 186-195, Feb. 2014. 

[5] L. Tong, B. M. Sadler, and M. Dong, “Pilot-assisted wireless trans¬ 
missions: General model, design criteria, and signal processing,” IEEE 
Signal Process. Mag., vol. 21, no. 6, pp. 12-25, Nov. 2004. 

[6] T. L. Marzetta, “How much training is required for multiuser MIMO?” 
in Proc. Annu. Asilomar Conf. Signals, Systems, and Computers (AS1LO- 
MAR), Pacific Grove, CA, 2006, pp. 359-363. 

[7] D. Tse and P. Viswanath, Fundamentals of Wireless Communication. 
New York, NY: Cambridge University Press, 2005. 

[8] B. Clerckx and C. Oestges, MIMO Wireless Networks: Channels, 
Techniques and Standards for Multi-Antenna, Multi-User and Multi-Cell 
Systems, 2nd ed. Oxford, UK: Academic Press, 2013. 

[9] J. Hoydis, S. ten Brink, and M. Debbah, “Massive MIMO in the UL/DL 
of cellular networks: How many antennas do we need?” IEEE J. Sel. 
Areas Commun., vol. 31, no. 2, pp. 160-171, Feb. 2013. 

[10] J. Jose, A. Ashikhmin, T. L. Marzetta, and S. Vishwanath, “Pilot 
contamination and precoding in multi-cell TDD Systems,” IEEE Trans. 
Wireless Commun., vol. 10, no. 8, pp. 2640-2651, Aug. 2011. 

[11] H. Yin, D. Gesbert, M. Filippou, and Y. Liu, “A coordinated approach 
to channel estimation in large-scale multiple-antenna systems,” IEEE J. 
Sel. Areas Commun., vol. 31, no. 2, pp. 264-273, Feb. 2013. 

[12] F. Fernandes, A. Ashikhmin, and T. L. Marzetta, “Inter-cell interference 
in noncooperative TDD large scale antenna systems,” IEEE J. Sel. Areas 
Commun., vol. 31, no. 2, pp. 192-201, Feb. 2013. 

[13] H. Q. Ngo and E. G. Larsson, “EVD-based channel estimation in 
multicell multiuser MIMO systems with very large antenna arrays” 
in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing 
(ICASSP), Kyoto, Japan, 2012, pp. 3249-3252. 

[14] A. Ashikhmin and T. Marzetta, “Pilot contamination precoding in multi¬ 
cell large scale antenna systems,” in Proc. IEEE Int. Symp. Information 
Theory (ISIT), Cambridge, MA, 2012, pp. 1137-1141. 


[15] H. Q. Ngo, E. G. Larsson, and T. L. Marzetta, “The multicell multiuser 
MIMO uplink with very large antenna arrays and a finite-dimensional 
channel,” IEEE Trans. Commun., vol. 61, no. 6, pp. 2350-2361, June 
2013. 

[16] K. Liu, V. Raghavan, and A. M. Sayeed, “Capacity scaling and spectral 
efficiency in wide-band correlated MIMO channels,” IEEE Trans. Inf. 
Theory, vol. 49, no. 10, pp. 2504-2526, Oct. 2003. 

[17] G. Barriac and U. Madhow, “Characterizing outage rates for space- 
time communication over wideband channels,” IEEE Trans. Commun., 
vol. 52, no. 12, pp. 2198-2208, Dec. 2004. 

[18] K. I. Pedersen, P. E. Mogensen, and B. H. Fleury, “A stochastic model 
of the temporal and azimuthal dispersion seen at the base station in 
outdoor propagation environments,” IEEE Trans. Veh. Technol., vol. 49, 
no. 2, pp. 437^147, Mar. 2000. 

[19] X. Gao, F. Tufvesson, O. Edfors, and F. Rusek, “Measured propagation 
characteristics for very-large MIMO at 2.6 GHz,” in Proc. Annu. 
Asilomar Conf. Signals, Systems, and Computers (ASILOMAR), Pacific 
Grove, CA, 2012, pp. 295-299. 

[20] M. Viberg, B. Ottersten, and A. Nehorai, “Performance analysis of 
direction finding with large arrays and finite data,” IEEE Trans. Signal 
Process., vol. 43, no. 2, pp. 469^477, Feb. 1995. 

[21] A. Adhikary, J. Nam, J.-Y. Ahn, and G. Caire, “Joint spatial division and 
multiplexing-The large-scale array regime,” IEEE Trans. Inf. Theory, 
vol. 59, no. 10, pp. 6441-6463, Oct. 2013. 

[22] A. L. Swindlehurst, E. Ayanoglu, P. Heydari, and F. Capolino, 
“Millimeter-wave massive MIMO: The next wireless revolution?” IEEE 
Commun. Mag., vol. 52, no. 9, pp. 56-62, Sept. 2014. 

[23] T. L. Marzetta, G. H. Tucci, and S. H. Simon, “A random matrix- 
theoretic approach to handling singular covariance estimates,” IEEE 
Trans. Inf. Theory, vol. 57, no. 9, pp. 6256-6271, Sept. 2011. 

[24] G. Barriac and U. Madhow, “Space-time communication for OFDM with 
implicit channel feedback,” IEEE Trans. Inf. Theory, vol. 50, no. 12, pp. 
3111-3129, Dec. 2004. 

[25] T. Kailath, A. H. Sayed, and B. Hassibi, Linear Estimation. Upper 
Saddle River, NJ: Prentice Hall, 2000. 

[26] R. Vaughan and J. B. Andersen, Channels, Propagation and Antennas 
for Mobile Communications. Milton Keynes, UK: Institution of 
Electrical Engineers, 2003. 

[27] A. Pascual-Iserte, D. P. Palomar, A. I. Perez-Neira, and M. A. Lagunas, 
“A robust maximin approach for MIMO communications with imperfect 
channel state information based on convex optimization,” IEEE Trans. 
Signal Process., vol. 54, no. 1, pp. 346-360, Jan. 2006. 

[28] X. Zhang, D. P. Palomar, and B. Ottersten, “Statistically robust design of 
linear MIMO transceivers,” IEEE Trans. Signal Process., vol. 56, no. 8, 
pp. 3678-3689, Aug. 2008. 

[29] P. Viswanath and D. N. C. Tse, “Sum capacity of the vector Gaussian 
broadcast channel and uplink-downlink duality,” IEEE Trans. Inf. The¬ 
ory, vol. 49, no. 8, pp. 1912-1921, Aug. 2003. 

[30] S. Shi, M. Schubert, and H. Boche, “Downlink MMSE transceiver 
optimization for multiuser MIMO systems: Duality and sum-MSE 
minimization,” IEEE Trans. Signal Process., vol. 55, no. 11, pp. 5436- 
5446, Nov. 2007. 

[31] Y. S. Cho, J. Kim, W. Y. Yang, and C. G. Kang, MIMO-OFDM Wireless 
Communications with MATLAB. Singapore: John Wiley & Sons (Asia) 
Pte Ltd, 2010. 

[32] B. Hassibi and B. M. Hochwald, “How much training is needed in 
multiple-antenna wireless links?” IEEE Trans. Inf. Theory, vol. 49, no. 4, 
pp. 951-963, Apr. 2003. 

[33] G. A. F. Seber, A Matrix Handbook for Statisticians. Hoboken, NJ: 
John Wiley & Sons, Inc., 2008. 

[34] A. Hjprungnes, Complex-Valued Matrix Derivatives: With Applications 
in Signal Processing and Communications. New York, NY: Cambridge 
University Press, 2011. 

[35] R. A. Horn and C. R. Johnson, Matrix Analysis, 2nd ed. New York, 
NY: Cambridge University Press, 2012. 

[36] S. Boyd and L. Vandenberghe, Convex Optimization. New York, NY: 
Cambridge University Press, 2004. 

[37] S. Ohno and G. B. Giannakis, “Capacity maximizing MMSE-optimal 
pilots for wireless OFDM over frequency-selective block Rayleigh- 
fading channels,” IEEE Trans. Inf. Theory, vol. 50, no. 9, pp. 2138-2145, 
Sept. 2004. 




